Goal-Directed Learning: A Decision-Theoretic Model for Deciding What to Learn Next

نویسنده

  • Marie desJardins
چکیده

This paper describes a theory called Goal-Directed Learning (gdl) that uses the principle of decision theory to choose learning tasks. The expected utility of being able to predict various features of the environment is computed and those with highest expected utility can be used as learning goals, which an agent's inductive mechanism should form theories to predict. We present a general decision-theoretic formula for the utility of learning goals, formalizing the concept that the best learning goals are those which, if learned, would maximize the agent's expected utility. The performance element of pagoda (Probabilistic Autonomous GOal-Directed Agent), an autonomous agent design presented in (desJardins 1992), is described , and a formula is given for computing the utility of learning goals in pagoda.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Utilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs

Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...

متن کامل

An Introduction to Inference and Learning in Bayesian Networks

Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...

متن کامل

Learning Causal Bayesian Networks from Observations and Experiments: A Decision Theoretic Approach

We discuss a decision theoretic approach to learn causal Bayesian networks from observational data and experiments. We use the information of observational data to learn a completed partially directed acyclic graph using a structure learning technique and try to discover the directions of the remaining edges by means of experiment. We will show that our approach allows to learn a causal Bayesia...

متن کامل

Experiences of nursing students of evidence-based practice education according to rogers’ diffusion of innovation model: a directed content analysis

Introduction: Evidence based practice (EBP) education isessential in promoting of clinical care, but an effective educationalstrategy for teaching EBP in nursing faculties is not available.The aim of this study was to explore the experiences of nursingstudents of EBP Education according to Rogers’ Diffusion ofInnovation Model.Methods: This qualitative study was carried out using a directedconte...

متن کامل

Lifelong , self-directed learning and the maintenance of competence: the triple helix of continuing professional development

Abstract It has been proposed that we think of continuing medical education (CME) as a two-stranded helix, in which one strand represents the internal characteristics of the learner-physician, the other strand the culture and environment in which he or she practices and lives. In many countries, the product of these two strands has been increasingly termed ‘continuing professional development’...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1992